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The purpose of this article is to synthesize the extant research on peer-medi¬ 
ated interventions (PMIs) with English language learners (ELLs) in kinder¬ 
garten through Grade 12. Fourteen studies that were published in 
peer-reviewed journals from 1983 to 2013 were examined in terms of study 
characteristics, the effects on academic outcomes, study quality, and overall 
effectiveness. Structured, heterogeneous grouping was used in the 10 peer 
pairing and 4 collaborative/cooperative grouping PMIs with ELLs. Eight of 
the 14 studies included high methodological quality. Overall, PMIs with 
ELLs are associated with medium to large effects on measures of phonemic 
awareness, vocabulary, and comprehension when compared to teacher-medi¬ 
ated comparison conditions. More research on PMIs with ELLs in high 
school and across core content areas, particularly mathematics, is war¬ 
ranted. Implications and future research for PMIs with ELLs are discussed. 

Keywords: English language learners, systematic review, peer-mediated 
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English language learners (ELLs) are the fastest growing subgroup of students 
in the public education system in the United States (Genesee, Lindholm-Leary, 
Saunders, & Si Christian, 2005; Kindler, 2002). In 2012-2013, about 10% of 
public education students, or nearly 5 million students, participated in ELL pro¬ 
grams compared to 8.7% of students, or about 4 million students in 2002-2003 
(Kena et ah, 2015). On the National Assessment of Educational Progress reading 
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measure, ELLs underperform in comparison to their non-ELL peers (Snyder & 
Dillow, 2009-2013). From 2002 to 2013, fourth-grade and eighth-grade non- 
ELLs averaged from 39 to 44 scale points higher than ELLs on the National 
Assessment of Educational Progress reading measure (U.S. Department of 
Education, 2002-2013). The persistent discrepancy between ELLs’ and non- 
ELLs’ reading performance exemplifies the increased academic challenges the 
majority of ELLs face (Francis, Rivera, Lesaux, Kieffer, & Rivera, 2006). Low 
academic achievement of ELLs has severe implications at the high school level as 
well. ELLs are less likely to pass their state’s exit/graduation exam (Sullivan 
et al., 2005) and are more likely to drop out of school (Ruiz-de-Velasco, Fix, & 
Clewell, 2000) than their non-ELL peers. 

As schools implement the Common Core State Standards (CCSS), it is critical 
that school leaders and teachers understand what evidence-based instructional 
approaches and interventions might be used to improve ELLs’ college and career 
readiness skills (DiCerbo, Anstrom, Baker, & Rivera, 2014; National Governors 
Association [NGA] & Council of Chief State School Officers [CCSSO], 2010). 
Mastery of academic language skills is arguably the single most important key to 
academic success for ELLs (Francis et al., 2006). Proficiency in academic lan¬ 
guage affects ELLs’ ability to comprehend and analyze texts, their ability to write 
and express themselves effectively, and their acquisition of academic content 
(Echevarria, Short, & Powers, 2006; Francis et al., 2006; Gersten et al., 2007). 
Skills such as these are included in the CCSS as students master the standards in 
reading, writing, speaking, listening, and language in each content area (NGA & 
CCSSO, 2010). 

A limited but expanding body of empirically supported instructional approaches 
and interventions is available to help educators meet the needs of all ELLs, espe¬ 
cially those with academic difficulties. The Institute of Education Sciences prac¬ 
tice guides provide information on evidence-based literacy and English language 
instructional practices for ELLs in the elementary grades (Gersten et al., 2007) 
and for teaching academic content and literacy to ELLs in elementary and middle 
school (Baker et al., 2014). Gersten et al. (2007) provided five recommendations 
to improve ELLs’ literacy: (a) screen for reading problems and monitor progress, 
(b) provide intensive small-group reading interventions, (c) provide extensive and 
varied vocabulary instruction, (d) develop academic English, and (e) schedule 
regular peer-assisted learning opportunities. Researchers suggest that teachers of 
ELLs should devote approximately 90 minutes a week to instructional activities 
in which pairs of students at different ability levels or different English language 
proficiencies work together in a structured fashion to practice and extend aca¬ 
demic material that was taught previously (Gersten et al., 2007). Likewise, in a 
more recent practice guide, Baker et al. (2014) recommended small-group instruc¬ 
tion with ELLs struggling in areas of literacy and English language development. 
In sum, small-group instruction and peer practice, often characterized as peer- 
mediated interventions (PMIs), are recommended practices for improving reading 
achievement (What Works Clearinghouse [WWC], 2007a, 2010b) and English 
language development of ELLs (WWC, 2007a, 2007b). 

Many researchers use the term PMIs when referring to either peer pairing/ 
peer tutoring or collaborative/cooperative grouping (Maheady, Harper, & Sacca, 
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1988; Wexler, Reed, Pyle, Mitchell, & Barton, 2015). In peer pairing, students 
work in partners (dyads) to tutor each other (Greenwood, Delquadri, & Hall, 
1989). In collaborative and cooperative grouping, students work in small groups 
to accomplish goal-directed activities (Puzio & Colby, 2013). For the purpose 
of this review, PMI is defined as a structured intervention strategy in which 
peers work in dyads or small groups to practice an academic skill or learn aca¬ 
demic content. Furthermore, the term ELL is used to include similar labels (e.g., 
limited English proficient [LEP], Spanish-speaking bilinguals [SSB]) reported 
by other authors. 

PMIs provide ELLs with frequent opportunities to develop academic language 
and content area learning (Heron & Harris, 2001; Mercer & Mercer, 2001). 
Moreover, PMIs have the potential to effectively and rapidly increase English 
language development, particularly decontextualized language concepts with 
high degrees of cognitive challenge (Gersten & Baker, 2000). Heron, Welsch, and 
Goddard (2003) suggested that using students to support instruction is perhaps the 
most underused and valuable classroom resource. By using students to support 
instruction, teachers can provide individualized learning opportunities during the 
independent-practice phase of the instructional cycle (Greenwood, Carta, & 
Kamps, 1990; Hudson, Lignugaris/Kraft, & Miller, 1993). Although there is 
emerging evidence that supports the use of PMIs with ELLs, there is an extensive 
literature base that supports the use of PMIs for improving learning of students 
with disabilities in mathematics (Baker, Gersten, & Lee, 2002; L. S. Fuchs, Fuchs, 
& Kams, 2001; L. S. Fuchs, Fuchs, Yazdian, & Powell, 2002), reading (Allor, 
Fuchs, & Mathes, 2001; D. Fuchs, Fuchs, & Burish, 2000), science (Bowman- 
Perrott, Greenwood, & Tapia, 2007; Mastropieri et al., 2006), social studies 
(Mastropieri, Scruggs, Spencer, & Fontana, 2003; Scruggs, Mastropieri, & 
Marshak, 2012), and spelling (Greenwood et ah, 1989; Harper, Mallette, Maheady, 
Parkes, & Moore, 1993). 

One procedural hallmark of PMIs is matching students strategically. Two stra¬ 
tegic approaches used to match students are heterogeneous grouping and homo¬ 
geneous grouping. Heterogeneous groups include students of various ability 
levels, such as high performers matched with low performers (Lou et ah, 1996). 
For example, in peer pairing interventions teachers may rank-order students in the 
class based on a specific variable (e.g., reading ability level) with the highest per¬ 
forming students in one column and the middle to lowest performing students in 
the next column. Students are then paired by rank order in the two columns, that 
is, the highest performing student with the middle-performing student, the second 
highest performing student with the second middle-performing student, and so on, 
until all students are paired (D. Fuchs, Fuchs, Mathes, & Simmons, 1997). 
Similarly, in cooperative learning, teachers may rank-order students in the class, 
divide the list into three to five columns, and select one student from each column 
to create a heterogeneous group. On the other hand, homogeneous groups include 
students who perform at similar levels (Slavin, 1987). For instance, Slavin and 
Karweit (1985) grouped students on the basis of an initial basic skills mathematics 
test in each class. Students were rank-ordered and divided into a high group and a 
low group. Teachers differentiated materials and moved through the curriculum at 
a faster pace with the high group as compared to the low group. 
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Three systematic reviews conducted previously highlight the effects of PMIs 
on students’ outcomes. In a meta-analysis, Cole (2013) reviewed the effectiveness 
of peer-mediated instruction on ELLs’ oral and written language outcomes. In the 
two other reviews, Lou et al. (1996) and Puzio and Colby (2013) highlighted the 
effectiveness of cooperative and collaborative learning interventions on non- 
ELLs’ academic outcomes. 

Cole (2013) reported that peer-mediated instruction is highly effective for 
improving ELLs’ oral and written language. Cole suggested that peer-mediated 
instruction was more effective when the students’ native language was used dur¬ 
ing instruction. In addition, ELLs performed better in general education classes 
where they had access to their native English-speaking peers and language sup¬ 
port services. While Cole focused on oral and written language outcomes, the 
present review addresses all academic outcomes. Cole included international 
journals, dissertations, and technical reports, many of which were not peer- 
reviewed. In contrast, the present review includes only peer-reviewed studies con¬ 
ducted in U.S. schools. Finally, Cole included collaborative learning studies in 
which students were simply divided into groups and given a task. The students 
then established their own structure for accomplishing the task. In the present 
review, studies were excluded if they did not describe replicable tutoring or coop¬ 
erative learning procedures. Notably, only 3 of the 32 studies that were presented 
in Cole’s meta-analysis met the inclusion criteria for this systematic review 
(Almaguer, 2005; Calhoon, Al Otaiba, Greenberg, King, & Avalos, 2006; Prater 
& Bemudez, 1993). 

Lou et al. (1996) examined the effects of PMIs on student achievement at 
the elementary, secondary, and postsecondary levels. First, the authors ana¬ 
lyzed studies in which researchers compared a reciprocal peer tutoring format 
(i.e., students alternated between the roles of tutor and tutee) to students work¬ 
ing individually. Overall, small effects were found favoring reciprocal peer 
tutoring compared to individual student work in Grades 1 to 12. In their second 
analysis, Lou et al. compared reciprocal peer tutoring with homogeneous 
groups, to reciprocal peer tutoring with heterogeneous groups. Small effects 
were reported in favor of homogenous groups. Importantly, the authors did not 
disaggregate outcome data to report the effects of PMI groupings on ELLs’ 
achievement. 

Puzio and Colby (2013), in a meta-analysis of 18 research studies, examined 
the effectiveness of cooperative and collaborative learning interventions on 
literacy outcomes. They reported that the cooperative and collaborative activ¬ 
ity structures used in Cooperative Integrated Reading and Composition (Slavin, 
1987) and other cooperative and collaborative programs improved student lit¬ 
eracy achievement. Students who had access to cooperative and collaborative 
interventions had significantly higher literacy achievement scores than stu¬ 
dents who did not have access to cooperative and collaborative interventions. 
Similar to Lou et al. (1996), Puzio and Colby (2013) did not disaggregate out¬ 
come data to report the effects of cooperative or collaborative learning on 
ELLs’ literacy outcomes. 
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Rationale and Research Questions 

PMIs (peer pairing, cooperative or collaborative grouping) are often used to 
improve academic outcomes for students with and without disabilities. Despite 
the available research on PMIs, there is no systematic review that addresses the 
effectiveness of PMIs on the academic outcomes of ELLs. The purpose of this 
systematic review is to synthesize the available peer-reviewed literature to deter¬ 
mine the effects of PMIs on the academic outcomes of ELLs in Grades K-12. The 
following research questions are addressed: 

Research Question 1: What are the characteristics of PMIs with ELLs in 

Grades K-12? 

Research Question 2: What are the effects of PMIs on improving academic 

outcomes of ELLs in Grades K-12? 

Research Question 3: What is the methodological quality of the available 

PMI studies that address academic outcomes of ELLs in Grades K-12? 

Method 

Search Procedure and Corpus of Studies 

A four-step search procedure was carried out that included (a) an electronic 
search, (b) an ancestral search of all articles included in this review, (c) an 
ancestral search of three recent reviews, and (d) a hand search of nine peer- 
reviewed journals. An electronic search was conducted of Academic Search 
Premier, ERIC, and PsycINFO to locate studies published between 1983 and 
2013. This 30-year time frame encompasses research that adequately represents 
the current status of the field (see WWC, 2014). Terms and root words related 
to peer-mediated (peerpartner*, peer tutor*, peer mentor*, peer mediat*, peer 
support, peer pair*, peer interaction, peer learn*, peer-to-peer, peer instruct*, 
reciprocal teaching, reciprocal peer tutoring, peers as tutors, peer-assist*, 
tutor-tutee, collaborat*, cooperative, response group*) were used in combina¬ 
tion with ELL-related terms and root words ( English as a second language and 
ESL, limited English proficient, LEP, English language learner, ELL, English 
learner, EL, culturally linguistically diverse, CLD, language minority, bilin¬ 
gual, structured English immersion, dual language learner, second language 
learner) to locate relevant articles. 

The initial electronic search yielded 4,914 articles. The abstracts were divided 
among four raters and studies were selected based on the following criteria: 

1. Participants were identified as ELLs. To be inclusive of the most com¬ 
monly used terms to describe ELLs, we included studies with participants 
identified as ESL and LEP. Studies were excluded that included ELLs but 
did not disaggregate outcome data for ELLs (e.g., Klingner, Vaughn, & 
Schumm, 1998). 

2. Studies were accepted that included an experimental or quasi-experimen- 
tal treatment-comparison design, a single-case design (SCD), or a preex- 
perimental single group design (Campbell & Stanley, 1966). 
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3. Studies were included in which students implemented a structured PMI 
(e.g., peer pairing/peer tutoring; cooperative/collaborative grouping) as a 
treatment. Studies were excluded if students were not taught to implement 
a systematic PMI procedure with peers. For instance, studies were 
excluded if peers were simply assigned to work collaboratively in a peer 
pairing or peer grouping arrangement and left to structure the group on 
their own (e.g., MacArthur, Ferretti, & Okolo, 2002). 

4. Studies were accepted that reported an academic outcome (e.g., reading 
fluency, reading comprehension, content knowledge). Studies were 
excluded that only reported oral language development outcomes (e.g., 
number of utterances; August, 1987). 

5. Studies were accepted that were published in English and were conducted 
in U.S. schools. 

Twelve studies met the inclusion criteria. A search of the references of the three 
previous reviews of PMIs (Cole, 2013; Lou et al., 1996; Puzio & Colby, 2013) 
yielded an additional study. One additional study was also located through the 
ancestral search of all included studies. A hand search of the following nine jour¬ 
nals from which a majority of the studies were located was completed for 2012 to 
2013: Bilingual Research Journal, Exceptional Children, Journal of Learning 
Disabilities, Journal of Research on Educational Effectiveness, Journal of Special 
Education, Learning Disability Research & Practice, Learning Disability 
Quarterly, Remedial and Special Education, and TESOL Quarterly. No additional 
studies were located through the hand search. In sum, a total of 14 studies, reported 
in peer-reviewed journals between 1983 and 2013, met the criteria for inclusion in 
this synthesis. A majority of the studies were conducted between 2005 and 2009. 

Coding Procedures 

An extensive code sheet was developed based on elements specified in the 
What Works Clearinghouse Procedures and Standards Handbook (Version 3.0; 
WWC, 2014). The code sheet was used to organize the following essential infor¬ 
mation: (a) participants, (b) methodology, (c) intervention and comparison infor¬ 
mation, (d) clarity of causal inference, (e) measures, and (f) findings. The code 
sheet included a combination of forced-choice items (e.g., research design, assign¬ 
ment method), open-ended items (e.g., age of participants, duration of interven¬ 
tion, PMI type), and written description of the treatment and comparison 
conditions. 

Four raters were trained on the use and interpretation of items from the code 
sheet for a total of 5 hours. Each rater independently coded the same article and 
point-by-point agreement was calculated until interrater reliability of 100% was 
attained on one article. The remaining studies were divided among raters. Each 
article was independently coded and double coded by a second rater to check for 
accuracy. The mean interobserver agreement (IOA) index (i.e., agreements 
divided by agreements plus disagreements) was 93% with a range of 86% to 
100%. When disagreements occurred, raters reached consensus on how to code 
the particular item. 
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Effect Size Calculation 

To provide additional quantitative information effect sizes were calculated 
using Cohen’s d. Cohen’s d was calculated as the difference between the mean 
posttest score of the participants in the intervention condition and the mean post¬ 
test score of the participants in the comparison condition divided by the pooled 
standard deviation (Cooper, Hedges, & Valentine, 2009). Effect sizes were 
obtained for all group design studies in which data were available. We interpreted 
an effect size of d = 0.20 as small, d = 0.50 as medium, and d = 0.80 as large 
(Cohen, 1988). 


Quality Standards 

The seven standards used to assess the methodological quality of experimental, 
quasi-experimental, and SCD studies were derived from Gersten et al. (2005), 
Homer et al. (2005), and WWC (2014). For experimental and quasi-experimental 
studies, this review included randomization and attrition from the WWC group 
design standards. Additional methodological quality indicators included whether 
the researchers described fidelity of implementation procedures, thoroughly 
described intervention and comparison conditions, used multiple outcome mea¬ 
sures, used standardized measures, and computed an effect size (Gersten et al., 
2005). 

Quality indicators for SCD studies were developed based on both the Horner 
et al. (2005) and WWC (2014) design and evidence standards. SCD studies 
were evaluated on the strength of the research design to ensure adequate inter¬ 
nal validity and the strength of evidence through visual analysis as proposed by 
the WWC. The WWC design standards include the following: (a) the interven¬ 
tion was systematically manipulated, (b) the dependent variable was measured 
repeatedly, (c) IOA was assessed for a minimum of 20% of all sessions, (d) 
IOA met a minimum of 80% agreement/reliability across all sessions, (e) an 
adequate number of replications were demonstrated, and (f) a minimum of 
three data points were present in each phase. Additionally, we included Horner 
et al.’s (2005) criteria to evaluate whether the fidelity of the independent vari¬ 
able was measured during the intervention phase. For this review, high-quality 
group and SCD studies were those that met a minimum of six of the seven 
quality indicators. 


Results 

Data Analysis Plan 

This synthesis includes a range of study designs and PMI types. First, we 
address Research Question 1 by synthesizing study characteristics, including 
intervention features, and highlight similarities and differences across the corpus 
of studies. Second, we summarize the effects of PMIs on academic outcomes 
(Research Question 2) by intervention type (i.e., peer pairing and collaborative/ 
cooperative). Finally, we report the methodological quality of the PMI studies 
(Research Question 3), and summarize the overall findings of the high-quality 
studies of PMIs with ELLs. 
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Characteristics of Peer-Mediated Interventions With ELLs 
in Grades K—12 

The characteristics of the PMI studies are summarized in Table 1. The 14 stud¬ 
ies included a total of 1,777 students of which 1,092 (61%) were ELLs. A total of 
934 ELLs participated in the treatment-comparison studies (1,595 students par¬ 
ticipated overall), 145 ELLs participated in the SCD studies, and 13 ELLs partici¬ 
pated in the single-group study (37 students participated overall). Sample sizes 
ranged from 12 to 507. Slightly less than half of the student sample (n = 866) were 
in the elementary grades (K-6; n = 11 studies), and 915 students were in second¬ 
ary grades (7-12; n — 3 studies). Third graders were included the most often 
(n = 6 studies), and kindergarteners and eighth graders were included the least 
often (n = 1 study). PMIs with ELLs were evaluated with students in every grade 
level from Grades K-8. No researchers investigated the effects of PMIs with 
ELLs in Grades 9 to 12. 

In nine studies, researchers reported ethnicity: 66% (« = 560) of the partici¬ 
pants were Hispanic; 7% ( n = 57) were White; 6% (/? = 51) were Black; 1% (n = 
10) were Asian; and 19% (n = 164) were not reported or were reported as non- 
Hispanic. In six studies, researchers included students with disabilities. In three of 
the studies (Klingner & Vaughn, 1996, 2000; Saenz, Fuchs, & Fuchs, 2005), 
researchers specifically indicated that they included ELLs with a learning disabil¬ 
ity (LD) in the treatment condition (n = 36). 

Language Proficiency 

In seven studies, the authors reported on ELLs’ performance on language pro¬ 
ficiency measures. Researchers in four studies (i.e., Almaguer, 2005; Klingner & 
Vaughn, 2000; Madrid, Canas, & Ortega-Medina, 2007; Madrid, Canas, & 
Watson, 2003) included the Language Assessment Scale (LAS; De Avila & 
Duncan, 1984). Researchers in the remaining three studies (Calhoon, A1 Otaiba, 
Cihak, King, & Avalos, 2007; McMaster, Kung, Han, & Cao, 2008; Saenz et al., 
2005) reported ELLs’ performance on different language proficiency measures. 
These included the Woodcock-Munoz Language Survey-Revised (WMLS; 
Woodcock & Munoz-Sandoval, 1993), the IDEA Proficiency Test (Ballard, 
Dalton, & Tighe, 1995), and the Student Oral Language Observation Matrix 
(SOLOM; Minnesota Department of Education, 2003). 

Academic Skill Area 

PMIs were used in five academic skill areas: reading (n = 6 studies), social 
studies (/? = 3 studies), spelling (n = 3 studies), science (n = 1 study), and writing 
(n = 1 study). No researchers investigated the effects of PMIs with ELLs in 
mathematics. 

Language of Instruction 

In four studies, researchers reported that English was the language of instruc¬ 
tion. In seven studies, both English and Spanish were used during instruction 
(Calderon, Hertz-Lazarowitz, & Slavin, 1998; Calhoon et al., 2007; Greenwood, 
Arreaga-Mayer, Utley, Gavin, & Terry, 2001; Klingner & Vaughn, 1996, 2000; 
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Characteristics of peer-mediated intervention studies 
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TABLE 1 (continued) 
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Vaughn et al., 2009: Study 1 and Study 2). For instance, in the Klingner and 
Vaughn (2000) study, the authors indicated that vocabulary words and key con¬ 
cepts were translated into Spanish. It is possible that Spanish was spoken in the 
classroom even though English was the language of instruction. Klingner and 
Vaughn (1996) stated that even though instruction was provided in English, 
students were encouraged to use their native language (i.e., Spanish). Similarly, 
Almaguer (2005) and Calderon et al. (1998) reported that English was the pri¬ 
mary language of instruction in a bilingual program and Greenwood et al. 
(2001) specified that ELLs were educated in an ESL self-contained classroom 
with partial immersion. Spanish was used during vocabulary instruction in 
Vaughn et al. (2009: Study 1 and Study 2). In three studies, researchers did not 
report the language of instruction (Madrid et al., 2003; Madrid et al., 2007; 
Prater & Bemudez, 1993). 

Intervention Duration and Session Length 

The corpus of studies included a range of intervention durations and length of 
intervention sessions. In 12 of the 14 studies, researchers reported the number of 
sessions implemented (range = 27 sessions to 75 sessions; Klingner & Vaughn, 
1996, Madrid et al., 2003; Madrid et al., 2007, respectively). In seven studies, 
researchers reported that interventions lasted 15 weeks or longer. The longest 
intervention was implemented daily for the entire school year (Calderon et al., 
1998). Prater and Bemudez (1993) reported the shortest intervention duration, 3 
weeks. In 13 studies, researchers reported the number of sessions implemented 
per week (range = 2 days per week to daily sessions). In six studies, researchers 
implemented PMIs daily. In contrast, Klingner and Vaughn (2000) implemented 
their PMI two to three times per week, while Greenwood et al. (2001) imple¬ 
mented their PMI three to four times per week. In 13 of the 14 studies, researchers 
reported the length of individual sessions. In four studies, the intervention session 
ranged from 12 to 20 minutes, while in 9 studies, sessions ranged from 30 to 40 
minutes. 

Peer-Mediated Grouping Arrangements 

Researchers reported varying numbers of students that were grouped to partici¬ 
pate in PMIs. In 10 of the 14 studies, students were grouped in dyads. Calderon 
et al. (1998) reported that students were grouped in teams of four and Prater and 
Bemudez (1993) reported that peers were grouped in teams of four to five. 
Klingner and Vaughn (1996, 2000) reported that students were grouped in teams 
of six to seven. 

Researchers matched students heterogeneously in all the studies in this review. 
In all peer pairing studies, researchers strategically matched students. For exam¬ 
ple, in eight studies researchers rank-ordered all the students in the class on ability 
level (e.g., reading achievement) and paired higher achieving students with lower 
achieving students. In two studies, Vaughn et al. (2009) matched ELLs with their 
non-ELL peers based on reading test scores. Finally, in three of the four collabora¬ 
tive/cooperative grouping studies, researchers ensured ELL peers were included 
in each group. Calderon et al. (1998) did not describe a specific process for how 
students were heterogeneously matched. 
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Effects of Peer-Mediated Interventions on Academic 
Outcomes of ELLs in Grades K—12 

The outcomes of each study are organized by the type of PMI and summarized 
in Supplemental Table SI (in the online version of the journal). First, we describe 
the findings from the peer pairing intervention studies and then the collaborative/ 
cooperative grouping intervention studies. Within each section, we report effect 
sizes from the treatment-comparison design studies followed by findings from 
the preexperimental single-group and SCD studies, when applicable. 

Peer Pairing 

In 10 of the 14 studies, researchers used a peer pairing/peer tutoring format. 
Vaughn et al. (2009: Study 1 and Study 2) covered the same material using the 
same textbook during the same time in the treatment and comparison conditions. 
The treatment in Study 1 and in Study 2 included vocabulary instruction, brief 
videos and purposeful discussion, use of graphic organizers, and structured peer 
pairing as components of a PMI designed to improve reading of social studies 
content. In both studies, Vaughn et al. (2009) reported large effects on researcher- 
developed measures of comprehension for ELLs in the treatment conditions 
(effect size [ES] = 0.72, 0.84, respectively) as compared to ELLs in the compari¬ 
son condition. On the vocabulary measures, medium effects were reported for 
ELLs in Study 1 (ES = 0.64) and in Study 2 (ES = 0.50) as compared to ELLs in 
the comparison condition. In conclusion, in both studies, Vaughn et al. (2009) 
reported medium to large effects on researcher-developed measures of compre¬ 
hension and vocabulary for ELLs compared to ELLs in the comparison 
condition. 

Almaguer (2005) implemented dyad reading as the treatment using the same 
curriculum and instructional minutes as the teacher-mediated comparison condi¬ 
tion. In the dyad reading condition, the student with the higher reading level 
selected the basal reader and read aloud, while the student with the lower reading 
level read in unison. Medium to large effects were found for ELLs on the 
Comprehensive Reading Assessment Battery (CRAB) standardized measure of 
fluency (ES = 0.69) and reading comprehension (ES = 0.70) as compared to ELLs 
in the comparison condition. Also, small effects were found on the close reading 
comprehension measure (ES = 0.33). 

Peer-assisted learning strategies. In four treatment-comparison studies, research¬ 
ers compared PALS to school-implemented reading activities. The researchers 
implemented variations of PALS dependent on their participants’ grade level, 
including K-PALS, first-grade PALS, and PALS in Grades 3 to 6. Saenz et al. 
(2005) engaged peers in Grades 3 to 6 in Partner Reading with story retell, para¬ 
graph shrinking by summarizing the main points in paragraphs, and making story 
predictions. Dyads progressively added peer-mediated reading components dur¬ 
ing the intervention and received points for accurate and high-quality work. In 
general, ELLs in the PALS condition outperformed their ELL peers in a teacher- 
mediated instruction comparison condition. Effect sizes ranged from medium to 
very large on three measures of the CRAB, maze (ES = 0.40), words correct (ES 
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= 0.60), and questions correct (ES = 1.02). In a secondary analysis, data were 
disaggregated by four ELL learner types: high achievers, average achievers, low 
achievers, and learning disabled. ELLs of all learner types, with the exception of 
the average achievers on the maze, outperformed the comparison condition on 
each of the three CRAB measures. 

In two studies, Calhoon et al. (2006, 2007) strategically matched first graders 
in dyads and had them focus on beginning reading activities, sounds and words, 
and story sharing including previewing, predicting, taking turns while reading, 
and retelling with peers. In both studies, ELLs received the district’s mandated 
core reading program in the teacher-mediated comparison group with supplemen¬ 
tal repeated reading and phonics instruction using grade-appropriate materials 
(Calhoon et al., 2006) and with supplemental whole-group reading (Calhoon 
et al., 2007). Calhoon et al. (2006) reported mixed effects for ELLs on the 
Dynamic Indicators of Basic Early Literacy Skills oral reading fluency (ES = 
-0.06), nonsense word fluency (ES = 0.74), and phoneme segmentation fluency 
(ES = 0.73) as compared to the comparison condition. Mixed effects were also 
found for ELLs on the same measures in Calhoon et al. (2007), oral reading flu¬ 
ency (ES = 0.38), nonsense word fluency (ES =1.31), and phoneme segmentation 
fluency (ES = -0.06), and very large effects were reported on Dynamic Indicators 
of Basic Early Literacy Skills letter naming fluency (ES = 1.18). Notably, there 
were twice as many ELLs in the 2007 study compared to the 2006 study. 

McMaster et al. (2008) compared two treatment conditions of K-PALS (with 
ELLs and with non-ELLs, respectively) with typical school practice (with ELLs). 
In the treatment conditions, dyads practiced phonics activities, sentence reading, 
and error correction, in addition to teacher-mediated phonemic awareness activi¬ 
ties. ELLs who participated in K-PALS outperformed ELLs in the control condi¬ 
tion on three standardized measures, spelling (ES = 0.04), word identification (ES 
= 0.11), and word attack (ES = 0.22). On researcher-developed measures of pho¬ 
nemic awareness, rapid letter sounds, and oral reading, effect sizes ranged from 
0.10 to 0.69 favoring ELLs in the K-PALS condition compared to ELLs in the 
control condition. 


Classwide peer tutoring (CWPT). In three SCD studies, researchers imple¬ 
mented CWPT and examined outcomes on researcher-developed spelling mea¬ 
sures. Greenwood et al. (2001) implemented CWPT in a reciprocal tutoring 
format where the tutee read and wrote the word while the tutor read the words 
and provided corrective feedback and praise when appropriate. In addition to a 
researcher-developed spelling measure, Greenwood et al. assessed ELLs’ sight 
word vocabulary in one classroom. Over the course of the entire study, ELLs 
demonstrated a 60% mean increase (from pretest to posttest) in the four classes 
that addressed spelling and in the first-grade class of nonreaders that addressed 
sight word vocabulary. 

Madrid et al. (2003) compared passive CWPT, active CWPT, and teacher- 
mediated instruction. In passive CWPT, the tutee watched and listened while the 
tutor read and wrote the spelling words, provided prompts to the tutee to pay 
attention, and gave praise when appropriate. In active CWPT, the tutee wrote and 
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spelled the spelling words while the tutor read the words and provided corrective 
feedback and praise when appropriate. Teachers focused on the same reading, 
writing, and spelling activities as used in the CWPT conditions and implemented 
them across the 5 school days. ELLs in the passive CWPT and active CWPT con¬ 
ditions demonstrated almost equivalent spelling scores from pretest to posttest 
(51% score increase compared to 50% score increase, respectively) whereas ELLs 
in the teacher-mediated condition gained 27%. 

Madrid et al. (2007) replicated their 2003 study but varied how students earned 
points for their accurate work in the CWPT conditions. ELLs in the competitive 
CWPT teams accumulated points only for their individual team, whereas ELLs in 
the cooperative CWPT teams accumulated points for all participants across all 
teams. Similar to the 2003 study, teachers focused on the same activities as in the 
CWPT conditions and implemented them across the 5 school days. ELLs in coop¬ 
erative CWPT teams demonstrated an 81% increase on spelling scores from pre¬ 
test to posttest while ELLs in the competitive CWPT teams demonstrated a 67% 
gain whereas ELLs in the teacher-mediated condition demonstrated a 22% 
increase. 

In summary, the peer pairing intervention studies included peer tutoring, PALS, 
and CWPT. Researchers in four studies included ELLs in about half of their sam¬ 
ple whereas researchers in the other six studies included only ELLs. Overall, 
ELLs who participated in peer pairing interventions demonstrated gains on stan¬ 
dardized and researcher-developed measures of phonemic awareness, fluency, 
spelling, vocabulary, and reading comprehension. 

Collaborative/Cooperative Interventions 

In four treatment-comparison studies, researchers examined three different 
collaborative/cooperative PMIs. Calderon et al. (1998) examined Bilingual 
Cooperative Integrated Reading and Composition (BCIRC) compared to a control 
intervention that focused on round robin reading, and independent workbook 
activities and occasionally used cooperative learning strategies. ELLs in the 
BCIRC condition outperformed ELLs in the control condition on a state standard¬ 
ized measure of reading (ES = 0.58) and language (ES = 0.29). 

Prater and Bemudez (1993) implemented a peer-mediated writing process to 
compose a weekly paper. In the peer response groups, students provided guidance 
to their peers, including topic selection, commentary on strengths and weak¬ 
nesses, and help with revisions and editing. In the comparison condition, students 
worked individually to select a topic, compose a paper, and revise their drafts for 
submission based on the teacher’s edits. ELLs in the peer response groups outper¬ 
formed ELLs in the comparison condition on composition (ES = 0.19), number of 
idea units (ES = 0.30), number of words (ES = 0.63), and number of sentences 
(ES = 0.37). 

Collaborative strategic reading (CSR). Kdingner and Vaughn (1996) com¬ 
pared reciprocal teaching plus cross-age tutoring to reciprocal teaching plus 
cooperative learning. Reciprocal teaching focused on predicting, brainstorm¬ 
ing, clarifying word understanding, main idea identification, summariza¬ 
tion, and questioning. In reciprocal teaching plus tutoring, dyads practiced 
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comprehension strategies and in reciprocal teaching plus cooperative learning 
students practiced comprehension strategies in cooperative groups. ELLs in the 
reciprocal teaching plus cooperative learning condition outperformed ELLs in 
the reciprocal teaching plus tutoring condition on a standardized measure of 
reading comprehension (ES = 1.42). However, ELLs in the reciprocal teach¬ 
ing plus tutoring condition outperformed ELLs in the reciprocal teaching plus 
cooperative learning condition on a researcher-developed passage comprehen¬ 
sion measure (ES = 0.35). In the Klingner and Vaughn (2000) single-group 
study, ELLs engaged in a reading comprehension strategy that focused on read¬ 
ing text aloud, vocabulary, and discussion in collaborative groups with rotat¬ 
ing roles. On two researcher-developed vocabulary measures for two science 
chapters, ELLs demonstrated large vocabulary gains from pretest to posttest 
(ES = 1.08 and 1.05, respectively). 

In summary, the collaborative/cooperative grouping studies included a collab¬ 
orative writing process, BCIRC, and CSR. Klingner and Vaughn (2000) included 
ELLs in about half of their sample whereas researchers in the other three studies 
included only ELLs. ELLs who participated in collaborative/cooperative PMIs 
demonstrated gains on standardized measures of language and reading compre¬ 
hension and researcher-developed measures of writing, vocabulary, and reading 
comprehension. 

Methodological Quality of Peer-Mediated Interventions That 
Address Academic Outcomes of ELLs in Grades K—12 

Researchers used an experimental or quasi-experimental design in 10 studies, 
an SCD in 3 studies, and a preexperimental single-group design in one study. We 
highlight the methodological quality of the experimental and quasi-experimental 
studies in Table 2 and the quality of the SCD studies in Table 3. Klingner and 
Vaughn’s (2000) preexperimental single-group design is subject to numerous 
threats to internal validity (e.g., history, maturation, testing, and instrumentation) 
and was not evaluated for quality. 

A total of 8 of the 14 studies, all featuring treatment-comparison research 
designs, met at least six of seven quality indicators and were rated as studies with 
high methodological quality (i.e., Almaguer, 2005; Calhoon et al., 2006, 2007; 
Klingner & Vaughn, 1996; McMaster et al., 2008; Saenz et al., 2005; Vaughn 
et al., 2009: Study 1 and Study 2). Of the eight studies with high methodological 
quality, researchers in seven studies used random assignment at the student, class¬ 
room, or teacher level. Furthermore, researchers in all these studies reported 
information on fidelity of implementation. In all eight high-quality treatment- 
comparison studies, researchers described the treatment conditions thoroughly, 
reported the use of multiple outcome measures, and provided an effect size or 
enough data to compute an effect size, and attrition was computable and at accept¬ 
able rates (<30%). In six of the eight high-quality studies, researchers used at least 
one standardized measure. 

Overall, the three SCD studies in this review have serious methodological 
design flaws and do not meet our minimum of six of seven quality indicators. 
Madrid et al. (2003) and Madrid et al. (2007) systematically manipulated the inde¬ 
pendent variable by alternating the students through three different treatments 
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teachers to condition. A check mark indicates that evidence was present. A dash indicates that evidence was not present. 
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Note. IV = independent variable; DV = dependent variable; IOA = interobserver agreement; FOI = fidelity of 
implementation; Tx = treatment; WWC = What Works Clearinghouse. A check mark indicates that evidence was 
present. A dash indicates that evidence was not present. 


over 5 weeks. They also measured the dependent variable repeatedly by adminis¬ 
tering pretests and posttests to measure spelling achievement on five separate 
occasions. Although Madrid and colleagues reported reliability in their 2003 
study, they failed to report IOA percentages in their 2007 study. Moreover, in both 
studies, Madrid et al. reported fidelity during training and noted that the research¬ 
ers and classroom teachers independently approached the tutoring dyads to ensure 
tutoring procedures were being implemented but did not report fidelity data dur¬ 
ing the treatment phases. Finally, Madrid et al. (2003; Madrid et al., 2007) did not 
include graphs for each participant. They reported mean spelling results across 
students in bar graphs during the three conditions under investigation. Thus, nei¬ 
ther study met the criteria to assess effectiveness of the intervention for individual 
students. 

In the other SCD, Greenwood et al. (2001) employed an AB design (A = base¬ 
line implementation of CWPT learning management system; B = CWPT learning 
management system plus consultation, progress-based decision making, and feed¬ 
back) across five classrooms. Greenwood and colleagues measured the dependent 
variable repeatedly, reported acceptable levels of IOA, included acceptable levels 
of data points per phase, and used a 40-item checklist to assess fidelity of the 
intervention. However, Greenwood and colleagues did not systematically manip¬ 
ulate their peer-mediated treatment. Although the replications across classrooms 
are notable, without a reversal/withdrawal phase or a multiple baseline design, the 
AB design is subject to numerous threats to internal validity. Although we were 
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unable to determine the effectiveness of this study, the authors presented class 
growth spelling mean improvement, and in one class, mean vocabulary improve¬ 
ment, from pretest to posttest. 

None of the SCD studies reviewed in this article met the minimum number of 
quality design standards to assess the strength of evidence of a causal relationship 
using the WWC (2014) evidence standards. Thus, we did not synthesize the 
results of these studies or the preexperimental single-group design to determine 
the overall evaluation of PMIs in the next section. 

Overall Evaluation of Peer-Mediated Interventions With ELLs 

There is an emerging body of literature of PMIs with ELLs that includes high- 
quality experimental studies (n = 6) and quasi-experimental studies (n = 2). Seven 
of the eight studies that had high methodological quality ratings included peer pair¬ 
ing that resulted in small to large effect sizes favoring PMI conditions. These out¬ 
comes are consistently found with standardized measures and researcher-developed 
measures that resulted in mostly medium to large effects on phonemic awareness, 
vocabulary, and comprehension outcomes when compared to teacher-mediated 
comparison conditions (ES: 0.33-1.31). Also, these PMIs were often associated 
with improved fluency and decoding for ELLs when compared to teacher-mediated 
comparison conditions (ES: 0.10-0.69). On a less consistent basis, researchers 
report a wide range of effects on standardized measures of phoneme segmentation 
(ES: -0.06-0.73), fluency (ES: -0.06-0.69), and comprehension (ES: 0.33-1.42; 
Almaguer, 2005; Calhoon et al., 2006,2007; Klingner & Vaughn, 1996). Researchers 
also report positive effects on researcher-developed measures of vocabulary (ES: 
0.50-0.64) and comprehension (ES: 0.35-0.84; Klingner & Vaughn, 1996; Vaughn 
et al., 2009). Last, when separated by PMI model, 70% of the peer pairing studies 
had high methodological quality ratings, whereas only 25% of the collaborative/ 
cooperative grouping studies had high methodological quality ratings (see Table 2). 
In general, the studies that did not meet our high quality standards resulted in similar 
outcomes as the high-quality studies. 

Discussion 

In this systematic review of 14 studies, we analyzed the effects of PMIs imple¬ 
mented with ELLs in Grades K-12. In general, over half the studies reviewed in 
this article were considered to be of high methodological quality and the PMIs 
resulted in favorable academic outcomes for ELLs. This finding is discussed 
below relative to the methodological quality and effectiveness of the PMI studies 
within the context of the two types of PMI models, peer pairing and collaborative/ 
cooperative grouping. We then discuss the PMI characteristics that resulted in 
improved academic outcomes of ELLs. We conclude with implications and future 
research of PMIs with ELLs. 

Methodological Quality and Effectiveness of Peer-Mediated 
Interventions 

Many of the PMI studies included in this review met the established high qual¬ 
ity criteria (see Tables 2 and 3). From these studies we can conclude that PMIs are 
effective interventions with ELLs for improving phonemic awareness, fluency, 
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comprehension, and vocabulary outcomes. Specifically, we found that PALS 
mostly produced medium to large effects on phonemic awareness, fluency, and 
comprehension outcomes for ELLs in Grades K, 1, and 3 to 6. Similarly, the 
WWC (2010a) reported that PALS had potentially positive effects on reading 
achievement for ELLs in Grades 3 to 6. Finally, we found that structured PMIs 
that included elements of PALS or CWPT (e.g., paired reading) resulted in large 
effects on reading comprehension and medium effects on vocabulary (e.g., 
Vaughn et al., 2009) and fluency (e.g., Almaguer, 2005). Thus, it appears that 
PMIs that include paired reading aloud, repeated reading, reading with question¬ 
ing, previewing, predicting, reciprocal reading, retelling, and error correction are 
likely to produce positive academic outcomes for ELLs in phonemic awareness, 
fluency, vocabulary, and comprehension. 

We cannot confidently conclude that PMIs with ELLs improve their spelling 
performance, written compositions, and statewide tests in reading and language 
given the methodological quality of the reviewed research. Furthermore, we can¬ 
not draw conclusions about the effectiveness of CWPT with ELLs due to the 
methodological quality of the available research. The WWC (2010a) also could 
not draw conclusions about the effectiveness of CWPT with ELLs because no 
studies using CWPT with ELLs met the WWC quality criteria. Additional studies 
conducted with high methodological quality are required to confirm the effective¬ 
ness of PMIs, specifically CWPT, with ELLs in spelling and writing and on state¬ 
wide measures of reading and language. 

In previous reviews, researchers reported that substantially more studies 
included researcher-developed measures than standardized measures (Stenhoff & 
Lignugaris/Kraft, 2007; Wexler et al., 2015). In contrast, we found that standard¬ 
ized measures were used in half of the peer pairing studies and half of the collab¬ 
orative/cooperative grouping PMI studies. Researchers in few studies in this 
review reported ELL outcomes on both standardized and researcher-developed 
measures. Standardized test outcomes are generally more distal to the target out¬ 
comes than researcher-developed measures; therefore, large effect sizes on these 
measures suggest a broader achievement impact than the intervention specific 
outcomes typically measured using researcher-developed measures. In this 
review, researchers reported large effects of PMIs on standardized measures of 
reading comprehension in three high-quality studies. In four additional high-qual¬ 
ity studies, researchers reported mixed effects on standardized measures of pho¬ 
nemic awareness and fluency. This strengthens the conclusion that PMIs with 
ELLs will result in improved reading comprehension; however, we assert that 
more research should explore the effects of PMIs with ELLs on standardized mea¬ 
sures of phonemic awareness and fluency. 

Measurement of fidelity is a study design feature that researchers are more 
recently including in their research designs. In fact, Okilwa and Shelby (2010) and 
Wexler et al. (2015), in their syntheses of PMIs with students with disabilities, 
reported that nearly all the studies they reviewed included fidelity of implementa¬ 
tion. Similarly, in most studies, we found that researchers reported fidelity of 
implementation. Only a few years earlier, Stenhoff and Lignugaris/Kraft (2007) 
reported that fewer than half of the researchers in their review of PMIs with stu¬ 
dents with disabilities reported fidelity of implementation. Although the inclusion 
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of measurement of fidelity is an important quality indicator that researchers are 
increasingly accounting for in their design, there are various ways to measure and 
report fidelity in PMI research. For instance, some researchers reported fidelity of 
implementation collected during tutor training (e.g., Calhoon et al., 2006), whereas 
other researchers reported fidelity data collected during program implementation 
(e.g., Greenwood et al., 2001), and some researchers reported fidelity data col¬ 
lected during both training and implementation (e.g., Calhoon et al., 2007). To 
better understand the quality of treatment integrity, there may be value in measur¬ 
ing fidelity of the proposed tutor training procedures to evaluate the effectiveness 
of specific tutor training protocols. Likewise, there may be value in measuring 
fidelity of the tutors’ behavior during the implementation of students delivering 
PMIs to evaluate the effectiveness of specific procedures on tutees’ outcomes. It is 
also informative to measure the PMI features implemented in the comparison con¬ 
dition, such as reported in Saenz et al. (2005) and Vaughn et al. (2009). 

When separated by PMI types, we reviewed 10 peer pairing studies and 4 col¬ 
laborative/cooperative grouping studies. Direct comparisons between peer pairing 
and collaborative/cooperative grouping arrangements are limited because of non¬ 
overlapping dependent variable constructs and measures. Notably, only Klingner 
and Vaughn (1996) directly compared reciprocal teaching plus cooperative group¬ 
ing to reciprocal teaching plus tutoring, which resulted in improved comprehen¬ 
sion outcomes for ELLs in both conditions, but significantly larger effects on a 
standardized measure that favored ELLs in the reciprocal teaching plus coopera¬ 
tive grouping condition. Although limited, this study should be replicated in other 
content areas and grade levels to explore whether this is a generalized outcome or 
one that is unique to this instructional situation. Furthermore, studies of direct 
comparisons between peer pairing and collaborative/cooperative grouping are 
needed to determine if one arrangement is more effective than the other. 

Similar to our findings, Cole (2013) reported that PMIs, despite the type, result 
in positive outcomes for ELLs. Cole suggested that group size appears to be less 
important than the opportunity afforded to peers to actively engage with the con¬ 
tent during PMIs. Cole concluded that “teacher-centered, monologic instruction 
creates an atmosphere of silence that removes an important opportunity to learn 
for ELLs” (p. 163). Allowing students to interact with their peers to discuss 
instructional objectives and learning material provides students with increased 
opportunities to learn (Santos, Darling-Hammond, & Cheuk, 2012). Other 
researchers also recommend that teachers provide ELLs with opportunities to 
work in pairs and in small groups to improve their literacy, English language 
development, and academic content (Baker et al., 2014; Gersten et al., 2007). 

Although there is limited high-quality collaborative/cooperative grouping 
research available, BCIRC shows promise for improving ELLs’ reading and lan¬ 
guage outcomes on statewide tests in Grades 2 and 3 and Peer Tutor Response 
Groups are associated with improved writing for ELLs in Grade 4 (Calderon 
et al., 1998; Prater & Bemudez, 1993). The WWC (2007a) found that BCIRC had 
potentially positive effects on reading achievement and English language devel¬ 
opment for ELLs in Grades 2 and 3. In the only collaborative/cooperative group¬ 
ing study of high methodological quality, Klingner and Vaughn (1996) 
implemented CSR with ELLs and produced large effects on comprehension. 
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Additional, high-quality research is warranted to evaluate the effectiveness of col¬ 
laborative/cooperative grouping PMIs with ELLs and corroborate the findings 
reported in this review. 

Peer-Mediated Intervention Characteristics 

There are various characteristics of PMIs, including intervention features, that 
were consistently identified across the studies in this review. These systematic 
implementation procedures included specific strategies for presenting content to 
peers, providing feedback to peers, and managing time and task features. Other 
common features of PMIs with ELLs across all studies included strategic hetero¬ 
geneous grouping of students and reciprocal tutoring. Wexler et al. (2015) reported 
similar findings, suggesting that these are common PMI features. We found that 
less common features of PMIs with ELLs include the use of a reward system for 
motivation (n = 4 studies), and Greenwood et al. (2001) used a computer software 
tool to help teachers implement the PMI. Wexler et al. (2015) also reported that 
only one study in their PMI review included technology. More research is needed 
to evaluate whether motivational systems and technology increase the effective¬ 
ness and efficiency of PMIs with ELLs. 

Grade Level 

A majority of the studies were conducted in the elementary grades and no stud¬ 
ies were conducted in high school. Implementation of PMIs with ELLs in the 
elementary grades could be a result of recommendations that teachers prioritize 
the development of formal or academic English during instruction as early as pos¬ 
sible (Francis et al., 2006; Gersten et al., 2007). Students as young as of kinder¬ 
garten age can learn peer-assisted learning techniques if the routines are simple 
and explicitly taught (McMaster et al., 2008). As ELLs reach upper elementary 
grades, they can learn how to provide feedback and assist each other in English 
language vocabulary development (Calderon et al., 1998). These instructional 
strategies are critical to improve academic outcomes for ELLs across Grades 
K-12 (Francis et al., 2006). Notably, a substantial gap in the available literature is 
the evaluation of PMIs with ELLs in high school. 

Content Area 

Considering the overlap of language and literacy instructional goals for ELLs, 
it is not surprising that in the majority of the studies researchers addressed areas 
of English language arts such as reading, spelling, or writing. Researchers tar¬ 
geted social studies and science in fewer studies, and in the studies we reviewed, 
researchers did not implement PMIs in mathematics. Thus, there is a clear need to 
address the effects of PMIs with ELLs in mathematics and other content areas 
such as science and social studies. 

Language of Instruction 

Researchers implemented PMIs with English-only instruction or a combina¬ 
tion of English and Spanish instruction. Researchers did not report that other lan¬ 
guages were spoken during the implementation of PMIs with ELLs. Several 
researchers reported that although English was the language of instruction, 
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materials were created in Spanish and students were encouraged to use their 
native language to help facilitate understanding of concepts. ELLs who received 
instruction in Spanish, to some degree, demonstrated improved outcomes in 
nearly all the academic areas represented in this review (i.e., phonemic aware¬ 
ness, fluency, vocabulary, and comprehension). Similarly, ELLs who received 
English-only PMI instruction demonstrated improved results on measures of pho¬ 
nemic awareness, fluency, and comprehension. It is possible, however, that ELLs 
spoke Spanish even though researchers reported that English was the official lan¬ 
guage of instruction. Additional research is needed to understand the extent to 
which the language of instruction influences ELLs’ outcomes across a breadth of 
academic areas. 

Language Proficiency 

ELLs’ scores on standardized measures that reflect language proficiency were 
reported inconsistently across studies. Moreover, in the studies in which language 
proficiency scores were reported, researchers employed various assessments 
(e.g., LAS, SOLOM, WMLS). To make language proficiency comparisons across 
studies, researchers must either use identical language proficiency tests or concur¬ 
rent validity studies must be conducted to equate the various language proficiency 
scores. For example, McMaster et al. (2008) and Saenz et al. (2005) are the only 
high-quality studies that included language proficiency scores. However, differ¬ 
ent language measures were used (i.e., SOLOM, WMLS) and no concurrent 
validity studies could be located to support the comparison of ELLs’ language 
proficiency scores as a result of participating in PMIs. Thus, it is not clear to what 
extent ELLs across studies had the same or different levels of language profi¬ 
ciency. Given the role of English language proficiency in learning new skills in 
English (Cardenas-Hagan, Carlson, & Pollard-Durodola, 2007; Goldenberg, 
2008), quantitative data, such as scores on language proficiency measures, are 
needed to better understand the variability in ELLs’ language proficiency and to 
determine the effectiveness of PMIs with ELLs who have different levels of lan¬ 
guage proficiency. 

Group Size 

Strategic matching was used in all peer pairing (dyads) studies and in two of 
the four collaborative/cooperative grouping (small groups) studies. In the other 
two studies, the authors divided up ELLs among all the collaborative/cooperative 
groups. Although we can assert that heterogeneous groupings are associated with 
positive, academic outcomes for ELLs, we do not know if heterogeneous group¬ 
ings are critical to the success of PMIs. Additional research is needed to determine 
if homogeneous groupings with ELLs will result in outcomes that are similar to 
those observed with heterogeneous groupings. 

Instructional Features and Training 

Gersten et al. (2007) recommended that elementary teachers of ELLs should 
devote approximately 90 minutes a week to instructional activities in which dyads 
at different ability levels or different English language proficiencies work together 
on academic tasks in a structured fashion to practice and extend material already 
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taught. Researchers in all 14 studies in this review provided the recommended 
intensity within the week, regardless of the variability in weeks of instruction. 
However, in six studies, researchers employed multicomponent packages that 
included teacher-mediated instruction. Although the instructional package was 
implemented for at least 90 minutes per week, it was not possible to determine 
exactly how much time peers actually tutored or worked collaboratively within the 
PM1. Thus, in these studies, it was not possible to isolate the effects of the PMI from 
the other components implemented during the intervention. In the future, research¬ 
ers should report the number of minutes peers engage in the PMI activity to deter¬ 
mine the session length and duration associated with the largest academic gains for 
ELLs. This information is especially important to distinguish the effects of PMIs 
with ELLs separate of and in combination with teacher-mediated instruction. 

Another feature of PMIs that is associated with improved academic outcomes 
includes instructional training prior to implementing PMIs. According to Stenhoff 
and Lignugaris/Kraft (2007), PMIs should include training before students begin 
the tutoring intervention. Researchers in the majority of studies included instruc¬ 
tional training for ELLs and monitoring/management training for teachers. Based 
on the available, yet limited information, ELLs received training that included 
modeling, corrective feedback, and practicing PMI procedures. Training for 
teachers typically included classroom observations with feedback and coaching 
from research staff. Maheady, Harper, Mallette, and Winstanley (1991) demon¬ 
strated that teachers could learn the essential components of a tutoring program in 
as little as 90 minutes, and that once trained, they implemented the program with¬ 
out additional assistance. It is not clear how much instructional training is required 
for ELLs to sufficiently learn how to implement the PMI to produce positive 
outcomes or the necessary amount of teacher training with the most essential fea¬ 
tures to support ELLs’ implementation of PMIs. 

Summary of Implications and Future Research 

Overall, there are eight studies that have high methodological quality ratings 
in which PMIs resulted in small to large effect sizes on academic outcomes for 
ELLs in kindergarten through Grade 8. In future studies, researchers should 
investigate PMIs that include standardized measures of academic skills, particu¬ 
larly in mathematics, to continuously increase our confidence in the effective¬ 
ness of PMIs with ELLs, especially for those students in Grades 9 to 12. More 
research on PMIs implemented with students with disabilities who are ELLs, in 
addition to comparisons among high-achieving, average-achieving, and low- 
achieving ELLs, is encouraged with more detailed information on ELLs’ English 
language proficiency given its relationship to academic achievement (Goldenberg, 
2008; Gottlieb, 2006). Future research should examine the effects of PMIs when 
implemented with students over longer periods of time. Another essential area of 
future research is to directly compare the effects of the PMI types, peer pairing 
versus collaborative/cooperative, on ELLs’ academic outcomes. Last, given the 
adoption of the CCSS, it may be beneficial to evaluate the effects of PMIs on 
ELLs’ listening, speaking, and language to assess the extent to which PMIs 
improve reading, writing, listening, speaking, and language across the disci¬ 
plines as intended in the CCSS. 
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